Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 1529 |
| Missing cells | 577 |
| Missing cells (%) | 2.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 802.5 KiB |
| Average record size in memory | 537.4 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 6 |
DataYear has constant value "2015" | Constant |
LargestPropertyUseType has a high cardinality: 56 distinct values | High cardinality |
Location has a high cardinality: 1493 distinct values | High cardinality |
LargestPropertyUseType is highly correlated with DataYear | High correlation |
NumberofBuildings is highly correlated with DataYear | High correlation |
DataYear is highly correlated with LargestPropertyUseType and 3 other fields | High correlation |
Neighborhood is highly correlated with DataYear | High correlation |
PrimaryPropertyType is highly correlated with DataYear | High correlation |
LargestPropertyUseType has 61 (4.0%) missing values | Missing |
ENERGYSTARScore has 507 (33.2%) missing values | Missing |
Location is uniformly distributed | Uniform |
OSEBuildingID has unique values | Unique |
PropertyGFAParking has 1187 (77.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-23 15:17:11.786495 |
|---|---|
| Analysis finished | 2021-02-23 15:17:42.699360 |
| Duration | 30.91 seconds |
| Software version | pandas-profiling v2.10.0 |
| Download configuration | config.yaml |
| Distinct | 1529 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15756.91629 |
|---|---|
| Minimum | 1 |
| Maximum | 50038 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 133.6 |
| Q1 | 602 |
| median | 21140 |
| Q3 | 24514 |
| 95-th percentile | 28320 |
| Maximum | 50038 |
| Range | 50037 |
| Interquartile range (IQR) | 23912 |
Descriptive statistics
| Standard deviation | 12924.61682 |
|---|---|
| Coefficient of variation (CV) | 0.8202503958 |
| Kurtosis | -0.6050413123 |
| Mean | 15756.91629 |
| Median Absolute Deviation (MAD) | 5460 |
| Skewness | 0.1435026679 |
| Sum | 24092325 |
| Variance | 167045719.9 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 21723 | 1 | 0.1% |
| 21219 | 1 | 0.1% |
| 27882 | 1 | 0.1% |
| 25001 | 1 | 0.1% |
| 22504 | 1 | 0.1% |
| 25543 | 1 | 0.1% |
| 21233 | 1 | 0.1% |
| 23951 | 1 | 0.1% |
| 23928 | 1 | 0.1% |
| Other values (1519) | 1519 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 50038 | 1 | |
| 50002 | 1 | |
| 49998 | 1 | |
| 49985 | 1 | |
| 49966 | 1 | |
| 49958 | 1 | |
| 49946 | 1 | |
| 49945 | 1 | |
| 49940 | 1 | |
| 49926 | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 91.2 KiB |
| 2015 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 6116 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
| Value | Count | Frequency (%) |
| 2015 | 1529 |
| Value | Count | Frequency (%) |
| 2015 | 1529 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1529 | |
| 0 | 1529 | |
| 1 | 1529 | |
| 5 | 1529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6116 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 1529 | |
| 0 | 1529 | |
| 1 | 1529 | |
| 5 | 1529 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6116 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 1529 | |
| 0 | 1529 | |
| 1 | 1529 | |
| 5 | 1529 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6116 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 1529 | |
| 0 | 1529 | |
| 1 | 1529 | |
| 5 | 1529 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 110.5 KiB |
| Small- and Mid-Sized Office | |
|---|---|
| Other | |
| Non-Refrigerated Warehouse | |
| Large Office | |
| Mixed Use Property | |
| Other values (19) |
Length
| Max length | 27 |
|---|---|
| Median length | 18 |
| Mean length | 16.90451275 |
| Min length | 5 |
Characters and Unicode
| Total characters | 25847 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel |
| Value | Count | Frequency (%) |
| Small- and Mid-Sized Office | 296 | |
| Other | 243 | |
| Non-Refrigerated Warehouse | 187 | |
| Large Office | 170 | |
| Mixed Use Property | 103 | 6.7% |
| Retail Store | 100 | 6.5% |
| Hotel | 73 | 4.8% |
| Worship Facility | 72 | 4.7% |
| Distribution Center | 51 | 3.3% |
| Medical Office | 43 | 2.8% |
| Other values (14) | 191 |
| Value | Count | Frequency (%) |
| office | 509 | |
| small | 296 | 8.7% |
| and | 296 | 8.7% |
| mid-sized | 296 | 8.7% |
| other | 243 | 7.1% |
| warehouse | 200 | 5.9% |
| non-refrigerated | 187 | 5.5% |
| large | 170 | 5.0% |
| store | 136 | 4.0% |
| mixed | 103 | 3.0% |
| Other values (25) | 982 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3292 | 12.7% |
| i | 2091 | 8.1% |
| 1889 | 7.3% | |
| r | 1802 | 7.0% |
| a | 1538 | 6.0% |
| t | 1262 | 4.9% |
| d | 1249 | 4.8% |
| f | 1247 | 4.8% |
| o | 1063 | 4.1% |
| l | 1049 | 4.1% |
| Other values (35) | 9365 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19177 | |
| Uppercase Letter | 3701 | 14.3% |
| Space Separator | 1889 | 7.3% |
| Dash Punctuation | 847 | 3.3% |
| Control | 88 | 0.3% |
| Decimal Number | 78 | 0.3% |
| Other Punctuation | 67 | 0.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 3292 | |
| i | 2091 | |
| r | 1802 | |
| a | 1538 | |
| t | 1262 | 6.6% |
| d | 1249 | 6.5% |
| f | 1247 | 6.5% |
| o | 1063 | 5.5% |
| l | 1049 | 5.5% |
| c | 742 | 3.9% |
| Other values (14) | 3842 |
| Value | Count | Frequency (%) |
| S | 878 | |
| O | 752 | |
| M | 443 | |
| R | 327 | 8.8% |
| W | 272 | 7.3% |
| N | 187 | 5.1% |
| L | 172 | 4.6% |
| U | 119 | 3.2% |
| C | 107 | 2.9% |
| P | 103 | 2.8% |
| Other values (5) | 341 | 9.2% |
| Value | Count | Frequency (%) |
| 1 | 39 | |
| 2 | 39 |
| Value | Count | Frequency (%) |
| 1889 |
| Value | Count | Frequency (%) |
| / | 67 |
| Value | Count | Frequency (%) |
| - | 847 |
| Value | Count | Frequency (%) |
| 88 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22878 | |
| Common | 2969 | 11.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 3292 | |
| i | 2091 | 9.1% |
| r | 1802 | 7.9% |
| a | 1538 | 6.7% |
| t | 1262 | 5.5% |
| d | 1249 | 5.5% |
| f | 1247 | 5.5% |
| o | 1063 | 4.6% |
| l | 1049 | 4.6% |
| S | 878 | 3.8% |
| Other values (29) | 7407 |
| Value | Count | Frequency (%) |
| 1889 | ||
| - | 847 | |
| 88 | 3.0% | |
| / | 67 | 2.3% |
| 1 | 39 | 1.3% |
| 2 | 39 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25847 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 3292 | 12.7% |
| i | 2091 | 8.1% |
| 1889 | 7.3% | |
| r | 1802 | 7.0% |
| a | 1538 | 6.0% |
| t | 1262 | 4.9% |
| d | 1249 | 4.8% |
| f | 1247 | 4.8% |
| o | 1063 | 4.1% |
| l | 1049 | 4.1% |
| Other values (35) | 9365 |
YearBuilt
Real number (ℝ≥0)
| Distinct | 112 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1960.390451 |
|---|---|
| Minimum | 1900 |
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1906 |
| Q1 | 1929 |
| median | 1965 |
| Q3 | 1988 |
| 95-th percentile | 2008 |
| Maximum | 2014 |
| Range | 114 |
| Interquartile range (IQR) | 59 |
Descriptive statistics
| Standard deviation | 32.87203394 |
|---|---|
| Coefficient of variation (CV) | 0.01676810552 |
| Kurtosis | -1.095019394 |
| Mean | 1960.390451 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | -0.2630421728 |
| Sum | 2997437 |
| Variance | 1080.570616 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1900 | 45 | 2.9% |
| 1979 | 30 | 2.0% |
| 2000 | 30 | 2.0% |
| 1910 | 30 | 2.0% |
| 1960 | 28 | 1.8% |
| 1970 | 28 | 1.8% |
| 1926 | 27 | 1.8% |
| 1962 | 26 | 1.7% |
| 1928 | 26 | 1.7% |
| 2008 | 25 | 1.6% |
| Other values (102) | 1234 |
| Value | Count | Frequency (%) |
| 1900 | 45 | |
| 1901 | 2 | 0.1% |
| 1902 | 9 | 0.6% |
| 1903 | 3 | 0.2% |
| 1904 | 13 | 0.9% |
| 1905 | 4 | 0.3% |
| 1906 | 14 | 0.9% |
| 1907 | 12 | 0.8% |
| 1908 | 10 | 0.7% |
| 1909 | 15 | 1.0% |
| Value | Count | Frequency (%) |
| 2014 | 9 | 0.6% |
| 2013 | 13 | |
| 2012 | 6 | 0.4% |
| 2011 | 2 | 0.1% |
| 2010 | 6 | 0.4% |
| 2009 | 23 | |
| 2008 | 25 | |
| 2007 | 9 | 0.6% |
| 2006 | 16 | |
| 2005 | 13 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 86.7 KiB |
| 1 | |
|---|---|
| 7 | 2 |
| 3 | 1 |
| 2 | 1 |
| 6 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1529 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1529 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1529 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1529 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 1524 | |
| 7 | 2 | 0.1% |
| 3 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 6 | 1 | 0.1% |
NumberofFloors
Real number (ℝ≥0)
| Distinct | 45 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 7 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.29303548 |
|---|---|
| Minimum | 0 |
| Maximum | 99 |
| Zeros | 5 |
| Zeros (%) | 0.3% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 13.95 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 6.794112496 |
|---|---|
| Coefficient of variation (CV) | 1.582589412 |
| Kurtosis | 49.58480487 |
| Mean | 4.29303548 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.884680254 |
| Sum | 6534 |
| Variance | 46.15996461 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 412 | |
| 2 | 350 | |
| 3 | 244 | |
| 4 | 143 | 9.4% |
| 5 | 99 | 6.5% |
| 6 | 83 | 5.4% |
| 7 | 33 | 2.2% |
| 8 | 21 | 1.4% |
| 11 | 18 | 1.2% |
| 10 | 16 | 1.0% |
| Other values (35) | 103 | 6.7% |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.3% |
| 1 | 412 | |
| 2 | 350 | |
| 3 | 244 | |
| 4 | 143 | 9.4% |
| 5 | 99 | 6.5% |
| 6 | 83 | 5.4% |
| 7 | 33 | 2.2% |
| 8 | 21 | 1.4% |
| 9 | 8 | 0.5% |
| Value | Count | Frequency (%) |
| 99 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| 63 | 1 | 0.1% |
| 56 | 1 | 0.1% |
| 55 | 1 | 0.1% |
| 49 | 1 | 0.1% |
| 47 | 1 | 0.1% |
| 46 | 1 | 0.1% |
| 42 | 5 | |
| 41 | 2 | 0.1% |
PropertyGFABuilding(s)
Real number (ℝ)
| Distinct | 1447 |
|---|---|
| Distinct (%) | 94.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96319.22629 |
|---|---|
| Minimum | -50550 |
| Maximum | 1765970 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | -50550 |
|---|---|
| 5-th percentile | 21027.6 |
| Q1 | 27788 |
| median | 45082 |
| Q3 | 90266 |
| 95-th percentile | 332731.2 |
| Maximum | 1765970 |
| Range | 1816520 |
| Interquartile range (IQR) | 62478 |
Descriptive statistics
| Standard deviation | 164287.3499 |
|---|---|
| Coefficient of variation (CV) | 1.705654792 |
| Kurtosis | 31.97702714 |
| Mean | 96319.22629 |
| Median Absolute Deviation (MAD) | 20882 |
| Skewness | 5.051704685 |
| Sum | 147272097 |
| Variance | 2.699033333 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 21600 | 8 | 0.5% |
| 28800 | 7 | 0.5% |
| 25920 | 7 | 0.5% |
| 36000 | 6 | 0.4% |
| 24000 | 5 | 0.3% |
| 33300 | 3 | 0.2% |
| 22388 | 2 | 0.1% |
| 56700 | 2 | 0.1% |
| 27800 | 2 | 0.1% |
| 38038 | 2 | 0.1% |
| Other values (1437) | 1485 |
| Value | Count | Frequency (%) |
| -50550 | 1 | |
| -43310 | 1 | |
| 10925 | 1 | |
| 12806 | 1 | |
| 15000 | 1 | |
| 16200 | 1 | |
| 17824 | 1 | |
| 17956 | 1 | |
| 18396 | 1 | |
| 19193 | 1 |
| Value | Count | Frequency (%) |
| 1765970 | 1 | |
| 1632820 | 1 | |
| 1400000 | 1 | |
| 1380959 | 1 | |
| 1323055 | 1 | |
| 1295457 | 1 | |
| 1258280 | 1 | |
| 1215718 | 1 | |
| 1172127 | 1 | |
| 1158691 | 1 |
| Distinct | 335 |
|---|---|
| Distinct (%) | 21.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14330.77436 |
|---|---|
| Minimum | -2 |
| Maximum | 512608 |
| Zeros | 1187 |
| Zeros (%) | 77.6% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 98093.6 |
| Maximum | 512608 |
| Range | 512610 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 45245.70688 |
|---|---|
| Coefficient of variation (CV) | 3.157240895 |
| Kurtosis | 34.20368265 |
| Mean | 14330.77436 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.15909704 |
| Sum | 21911754 |
| Variance | 2047173991 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1187 | |
| 13320 | 3 | 0.2% |
| 25920 | 2 | 0.1% |
| 25800 | 2 | 0.1% |
| 100176 | 2 | 0.1% |
| 20416 | 2 | 0.1% |
| 30000 | 2 | 0.1% |
| 10800 | 2 | 0.1% |
| 124216 | 1 | 0.1% |
| 52582 | 1 | 0.1% |
| Other values (325) | 325 | 21.3% |
| Value | Count | Frequency (%) |
| -2 | 1 | 0.1% |
| 0 | 1187 | |
| 1263 | 1 | 0.1% |
| 1392 | 1 | 0.1% |
| 2211 | 1 | 0.1% |
| 2352 | 1 | 0.1% |
| 3764 | 1 | 0.1% |
| 3834 | 1 | 0.1% |
| 4256 | 1 | 0.1% |
| 4553 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 512608 | 1 | |
| 440185 | 1 | |
| 407795 | 1 | |
| 389860 | 1 | |
| 368980 | 1 | |
| 335109 | 1 | |
| 327680 | 1 | |
| 319400 | 1 | |
| 303707 | 1 | |
| 297457 | 1 |
| Distinct | 56 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 61 |
| Missing (%) | 4.0% |
| Memory size | 103.2 KiB |
| Office | |
|---|---|
| Non-Refrigerated Warehouse | |
| Retail Store | |
| Other | |
| Worship Facility | |
| Other values (51) |
Length
| Max length | 52 |
|---|---|
| Median length | 11 |
| Mean length | 13.60490463 |
| Min length | 5 |
Characters and Unicode
| Total characters | 19972 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel |
| Value | Count | Frequency (%) |
| Office | 477 | |
| Non-Refrigerated Warehouse | 194 | |
| Retail Store | 97 | 6.3% |
| Other | 92 | 6.0% |
| Worship Facility | 70 | 4.6% |
| Hotel | 68 | 4.4% |
| Distribution Center | 52 | 3.4% |
| Medical Office | 43 | 2.8% |
| K-12 School | 39 | 2.6% |
| Supermarket/Grocery Store | 37 | 2.4% |
| Other values (46) | 299 | |
| (Missing) | 61 | 4.0% |
| Value | Count | Frequency (%) |
| office | 523 | |
| warehouse | 206 | 8.6% |
| non-refrigerated | 194 | 8.1% |
| other | 155 | 6.5% |
| store | 134 | 5.6% |
| facility | 98 | 4.1% |
| retail | 97 | 4.0% |
| 72 | 3.0% | |
| worship | 70 | 2.9% |
| hotel | 68 | 2.8% |
| Other values (80) | 779 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2788 | |
| i | 1695 | 8.5% |
| r | 1562 | 7.8% |
| t | 1302 | 6.5% |
| f | 1300 | 6.5% |
| o | 1078 | 5.4% |
| a | 1039 | 5.2% |
| 928 | 4.6% | |
| c | 893 | 4.5% |
| O | 691 | 3.5% |
| Other values (41) | 6696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15772 | |
| Uppercase Letter | 2673 | 13.4% |
| Space Separator | 928 | 4.6% |
| Dash Punctuation | 325 | 1.6% |
| Other Punctuation | 164 | 0.8% |
| Decimal Number | 78 | 0.4% |
| Open Punctuation | 16 | 0.1% |
| Close Punctuation | 16 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 691 | |
| R | 365 | |
| S | 336 | |
| W | 277 | |
| N | 194 | 7.3% |
| C | 131 | 4.9% |
| H | 122 | 4.6% |
| F | 107 | 4.0% |
| M | 92 | 3.4% |
| D | 79 | 3.0% |
| Other values (11) | 279 |
| Value | Count | Frequency (%) |
| e | 2788 | |
| i | 1695 | |
| r | 1562 | |
| t | 1302 | |
| f | 1300 | |
| o | 1078 | 6.8% |
| a | 1039 | 6.6% |
| c | 893 | 5.7% |
| l | 654 | 4.1% |
| n | 605 | 3.8% |
| Other values (11) | 2856 |
| Value | Count | Frequency (%) |
| / | 135 | |
| , | 20 | 12.2% |
| & | 9 | 5.5% |
| Value | Count | Frequency (%) |
| 1 | 39 | |
| 2 | 39 |
| Value | Count | Frequency (%) |
| 928 |
| Value | Count | Frequency (%) |
| - | 325 |
| Value | Count | Frequency (%) |
| ( | 16 |
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18445 | |
| Common | 1527 | 7.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 2788 | |
| i | 1695 | 9.2% |
| r | 1562 | 8.5% |
| t | 1302 | 7.1% |
| f | 1300 | 7.0% |
| o | 1078 | 5.8% |
| a | 1039 | 5.6% |
| c | 893 | 4.8% |
| O | 691 | 3.7% |
| l | 654 | 3.5% |
| Other values (32) | 5443 |
| Value | Count | Frequency (%) |
| 928 | ||
| - | 325 | 21.3% |
| / | 135 | 8.8% |
| 1 | 39 | 2.6% |
| 2 | 39 | 2.6% |
| , | 20 | 1.3% |
| ( | 16 | 1.0% |
| ) | 16 | 1.0% |
| & | 9 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19972 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 2788 | |
| i | 1695 | 8.5% |
| r | 1562 | 7.8% |
| t | 1302 | 6.5% |
| f | 1300 | 6.5% |
| o | 1078 | 5.4% |
| a | 1039 | 5.2% |
| 928 | 4.6% | |
| c | 893 | 4.5% |
| O | 691 | 3.5% |
| Other values (41) | 6696 |
| Distinct | 100 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 507 |
| Missing (%) | 33.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.37475538 |
|---|---|
| Minimum | 1 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 43 |
| median | 69 |
| Q3 | 86 |
| 95-th percentile | 98 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 29.06437633 |
|---|---|
| Coefficient of variation (CV) | 0.4659637725 |
| Kurtosis | -0.7631450277 |
| Mean | 62.37475538 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.6175145488 |
| Sum | 63747 |
| Variance | 844.7379713 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 31 | 2.0% |
| 1 | 26 | 1.7% |
| 95 | 25 | 1.6% |
| 81 | 24 | 1.6% |
| 89 | 23 | 1.5% |
| 86 | 21 | 1.4% |
| 91 | 21 | 1.4% |
| 93 | 20 | 1.3% |
| 77 | 20 | 1.3% |
| 98 | 20 | 1.3% |
| Other values (90) | 791 | |
| (Missing) | 507 |
| Value | Count | Frequency (%) |
| 1 | 26 | |
| 2 | 7 | 0.5% |
| 3 | 6 | 0.4% |
| 4 | 9 | 0.6% |
| 5 | 2 | 0.1% |
| 6 | 4 | 0.3% |
| 7 | 6 | 0.4% |
| 8 | 10 | 0.7% |
| 9 | 3 | 0.2% |
| 10 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 100 | 31 | |
| 99 | 19 | |
| 98 | 20 | |
| 97 | 19 | |
| 96 | 6 | 0.4% |
| 95 | 25 | |
| 94 | 19 | |
| 93 | 20 | |
| 92 | 18 | |
| 91 | 21 |
SiteEnergyUse(kBtu)
Real number (ℝ≥0)
| Distinct | 1526 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7553921.257 |
|---|---|
| Minimum | 0 |
| Maximum | 295812640 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 400782.7 |
| Q1 | 1149832 |
| median | 2508795 |
| Q3 | 6994638.5 |
| 95-th percentile | 28845309.4 |
| Maximum | 295812640 |
| Range | 295812640 |
| Interquartile range (IQR) | 5844806.5 |
Descriptive statistics
| Standard deviation | 18537172.75 |
|---|---|
| Coefficient of variation (CV) | 2.453980141 |
| Kurtosis | 126.6082797 |
| Mean | 7553921.257 |
| Median Absolute Deviation (MAD) | 1749527.5 |
| Skewness | 9.535819006 |
| Sum | 1.154239168 × 1010 |
| Variance | 3.436267736 × 1014 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2 | 0.1% |
| 2074152 | 2 | 0.1% |
| 2721954 | 1 | 0.1% |
| 13253979 | 1 | 0.1% |
| 11511880 | 1 | 0.1% |
| 11026945 | 1 | 0.1% |
| 2202114 | 1 | 0.1% |
| 1525624 | 1 | 0.1% |
| 1820292 | 1 | 0.1% |
| 1456039 | 1 | 0.1% |
| Other values (1516) | 1516 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 11441 | 1 | |
| 17150 | 1 | |
| 24126 | 1 | |
| 43943 | 1 | |
| 53401 | 1 | |
| 56493 | 1 | |
| 82824 | 1 | |
| 91996 | 1 | |
| 93802 | 1 |
| Value | Count | Frequency (%) |
| 295812640 | 1 | |
| 286685536 | 1 | |
| 284867168 | 1 | |
| 251191824 | 1 | |
| 137635696 | 1 | |
| 104977248 | 1 | |
| 94560088 | 1 | |
| 94178648 | 1 | |
| 85357952 | 1 | |
| 84980760 | 1 |
TotalGHGEmissions
Real number (ℝ≥0)
| Distinct | 1462 |
|---|---|
| Distinct (%) | 95.7% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 162.7709097 |
|---|---|
| Minimum | 0 |
| Maximum | 11824.89 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.6905 |
| Q1 | 18.4925 |
| median | 46.92 |
| Q3 | 135.0325 |
| 95-th percentile | 564.9955 |
| Maximum | 11824.89 |
| Range | 11824.89 |
| Interquartile range (IQR) | 116.54 |
Descriptive statistics
| Standard deviation | 557.2669998 |
|---|---|
| Coefficient of variation (CV) | 3.423627728 |
| Kurtosis | 243.3293813 |
| Mean | 162.7709097 |
| Median Absolute Deviation (MAD) | 35.83 |
| Skewness | 13.6927156 |
| Sum | 248713.95 |
| Variance | 310546.509 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.19 | 3 | 0.2% |
| 6.71 | 3 | 0.2% |
| 0 | 2 | 0.1% |
| 4.62 | 2 | 0.1% |
| 19.95 | 2 | 0.1% |
| 3.31 | 2 | 0.1% |
| 12.71 | 2 | 0.1% |
| 42.41 | 2 | 0.1% |
| 48.6 | 2 | 0.1% |
| 29.26 | 2 | 0.1% |
| Other values (1452) | 1506 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 0.08 | 1 | |
| 0.12 | 1 | |
| 0.17 | 1 | |
| 0.31 | 1 | |
| 0.35 | 1 | |
| 0.37 | 1 | |
| 0.64 | 1 | |
| 0.65 | 1 | |
| 0.66 | 1 |
| Value | Count | Frequency (%) |
| 11824.89 | 1 | |
| 10780.64 | 1 | |
| 8046.7 | 1 | |
| 4725.43 | 1 | |
| 3894.01 | 1 | |
| 3321.02 | 1 | |
| 3044.63 | 1 | |
| 2937.83 | 1 | |
| 2846.07 | 1 | |
| 2452.86 | 1 |
| Distinct | 1493 |
|---|---|
| Distinct (%) | 97.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 320.6 KiB |
| {'latitude': '47.66375728', 'longitude': '-122.3002168', 'human_address': '{"address": "2623 NE UNIVERSITY VILLAGE ST", "city": "SEATTLE", "state": "WA", "zip": "98105"}'} | 5 |
|---|---|
| {'latitude': '47.52593209', 'longitude': '-122.3308402', 'human_address': '{"address": "309 S CLOVERDALE ST", "city": "SEATTLE", "state": "WA", "zip": "98108"}'} | 5 |
| {'latitude': '47.52131741', 'longitude': '-122.3668974', 'human_address': '{"address": "2600 SW BARTON ST", "city": "SEATTLE", "state": "WA", "zip": "98126"}'} | 4 |
| {'latitude': '47.5829049', 'longitude': '-122.3228994', 'human_address': '{"address": "2203 AIRPORT WAY S", "city": "SEATTLE", "state": "WA", "zip": "98134"}'} | 4 |
| {'latitude': '47.5616226', 'longitude': '-122.3386303', 'human_address': '{"address": "4634 E MARGINAL WAY S", "city": "SEATTLE", "state": "WA", "zip": "98134"}'} | 3 |
| Other values (1488) |
Length
| Max length | 176 |
|---|---|
| Median length | 157 |
| Mean length | 157.6357096 |
| Min length | 151 |
Characters and Unicode
| Total characters | 241025 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1472 ? |
|---|---|
| Unique (%) | 96.3% |
Sample
| 1st row | {'latitude': '47.61219025', 'longitude': '-122.33799744', 'human_address': '{"address": "405 OLIVE WAY", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} |
|---|---|
| 2nd row | {'latitude': '47.61310583', 'longitude': '-122.33335756', 'human_address': '{"address": "724 PINE ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} |
| 3rd row | {'latitude': '47.61334897', 'longitude': '-122.33769944', 'human_address': '{"address": "1900 5TH AVE", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} |
| 4th row | {'latitude': '47.61421585', 'longitude': '-122.33660889', 'human_address': '{"address": "620 STEWART ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} |
| 5th row | {'latitude': '47.6137544', 'longitude': '-122.3409238', 'human_address': '{"address": "401 LENORA ST", "city": "SEATTLE", "state": "WA", "zip": "98121"}'} |
| Value | Count | Frequency (%) |
| {'latitude': '47.66375728', 'longitude': '-122.3002168', 'human_address': '{"address": "2623 NE UNIVERSITY VILLAGE ST", "city": "SEATTLE", "state": "WA", "zip": "98105"}'} | 5 | 0.3% |
| {'latitude': '47.52593209', 'longitude': '-122.3308402', 'human_address': '{"address": "309 S CLOVERDALE ST", "city": "SEATTLE", "state": "WA", "zip": "98108"}'} | 5 | 0.3% |
| {'latitude': '47.52131741', 'longitude': '-122.3668974', 'human_address': '{"address": "2600 SW BARTON ST", "city": "SEATTLE", "state": "WA", "zip": "98126"}'} | 4 | 0.3% |
| {'latitude': '47.5829049', 'longitude': '-122.3228994', 'human_address': '{"address": "2203 AIRPORT WAY S", "city": "SEATTLE", "state": "WA", "zip": "98134"}'} | 4 | 0.3% |
| {'latitude': '47.5616226', 'longitude': '-122.3386303', 'human_address': '{"address": "4634 E MARGINAL WAY S", "city": "SEATTLE", "state": "WA", "zip": "98134"}'} | 3 | 0.2% |
| {'latitude': '47.62124083', 'longitude': '-122.3534322', 'human_address': '{"address": "305 HARRISON ST", "city": "SEATTLE", "state": "WA", "zip": "98109"}'} | 3 | 0.2% |
| {'latitude': '47.5309583', 'longitude': '-122.3320685', 'human_address': '{"address": "121 S KENYON ST", "city": "SEATTLE", "state": "WA", "zip": "98108"}'} | 3 | 0.2% |
| {'latitude': '47.64171214', 'longitude': '-122.3173859', 'human_address': '{"address": "2400 11TH AVE E", "city": "SEATTLE", "state": "WA", "zip": "98102"}'} | 3 | 0.2% |
| {'latitude': '47.60387657', 'longitude': '-122.33374327', 'human_address': '{"address": "818 2ND AVE", "city": "SEATTLE", "state": "WA", "zip": "98104"}'} | 3 | 0.2% |
| {'latitude': '47.59845416', 'longitude': '-122.300978', 'human_address': '{"address": "2309 S JACKSON ST", "city": "SEATTLE", "state": "WA", "zip": "98144"}'} | 2 | 0.1% |
| Other values (1483) | 1494 |
| Value | Count | Frequency (%) |
| city | 1542 | 6.4% |
| zip | 1529 | 6.3% |
| longitude | 1529 | 6.3% |
| wa | 1529 | 6.3% |
| address | 1529 | 6.3% |
| seattle | 1529 | 6.3% |
| human_address | 1529 | 6.3% |
| state | 1529 | 6.3% |
| latitude | 1529 | 6.3% |
| ave | 868 | 3.6% |
| Other values (4178) | 9501 |
Most occurring characters
| Value | Count | Frequency (%) |
| " | 24464 | 10.1% |
| 22614 | 9.4% | |
| ' | 18348 | 7.6% |
| : | 10703 | 4.4% |
| t | 9174 | 3.8% |
| d | 9174 | 3.8% |
| a | 7645 | 3.2% |
| e | 7645 | 3.2% |
| , | 7645 | 3.2% |
| s | 7645 | 3.2% |
| Other values (53) | 115968 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 73392 | |
| Other Punctuation | 64218 | |
| Decimal Number | 44817 | |
| Uppercase Letter | 26810 | 11.1% |
| Space Separator | 22614 | 9.4% |
| Open Punctuation | 3058 | 1.3% |
| Close Punctuation | 3058 | 1.3% |
| Dash Punctuation | 1529 | 0.6% |
| Connector Punctuation | 1529 | 0.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 4908 | |
| A | 4820 | |
| T | 4401 | |
| S | 2880 | |
| W | 2065 | |
| L | 1943 | 7.2% |
| N | 1013 | 3.8% |
| V | 976 | 3.6% |
| R | 712 | 2.7% |
| H | 530 | 2.0% |
| Other values (15) | 2562 |
| Value | Count | Frequency (%) |
| t | 9174 | |
| d | 9174 | |
| a | 7645 | |
| e | 7645 | |
| s | 7645 | |
| i | 6116 | |
| u | 4587 | 6.2% |
| l | 3058 | 4.2% |
| n | 3058 | 4.2% |
| r | 3058 | 4.2% |
| Other values (8) | 12232 |
| Value | Count | Frequency (%) |
| 1 | 6991 | |
| 2 | 6618 | |
| 4 | 4635 | |
| 3 | 4516 | |
| 0 | 3880 | |
| 7 | 3879 | |
| 9 | 3851 | |
| 8 | 3846 | |
| 6 | 3429 | |
| 5 | 3172 |
| Value | Count | Frequency (%) |
| " | 24464 | |
| ' | 18348 | |
| : | 10703 | |
| , | 7645 | 11.9% |
| . | 3058 | 4.8% |
| Value | Count | Frequency (%) |
| { | 3058 |
| Value | Count | Frequency (%) |
| 22614 |
| Value | Count | Frequency (%) |
| - | 1529 |
| Value | Count | Frequency (%) |
| _ | 1529 |
| Value | Count | Frequency (%) |
| } | 3058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 140823 | |
| Latin | 100202 |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 9174 | 9.2% |
| d | 9174 | 9.2% |
| a | 7645 | 7.6% |
| e | 7645 | 7.6% |
| s | 7645 | 7.6% |
| i | 6116 | 6.1% |
| E | 4908 | 4.9% |
| A | 4820 | 4.8% |
| u | 4587 | 4.6% |
| T | 4401 | 4.4% |
| Other values (33) | 34087 |
| Value | Count | Frequency (%) |
| " | 24464 | |
| 22614 | ||
| ' | 18348 | |
| : | 10703 | 7.6% |
| , | 7645 | 5.4% |
| 1 | 6991 | 5.0% |
| 2 | 6618 | 4.7% |
| 4 | 4635 | 3.3% |
| 3 | 4516 | 3.2% |
| 0 | 3880 | 2.8% |
| Other values (10) | 30409 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 241025 |
Most frequent character per block
| Value | Count | Frequency (%) |
| " | 24464 | 10.1% |
| 22614 | 9.4% | |
| ' | 18348 | 7.6% |
| : | 10703 | 4.4% |
| t | 9174 | 3.8% |
| d | 9174 | 3.8% |
| a | 7645 | 3.2% |
| e | 7645 | 3.2% |
| , | 7645 | 3.2% |
| s | 7645 | 3.2% |
| Other values (53) | 115968 |
CouncilDistrictCode
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.431000654 |
|---|---|
| Minimum | 1 |
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.202296898 |
|---|---|
| Coefficient of variation (CV) | 0.4970202152 |
| Kurtosis | -1.604539498 |
| Mean | 4.431000654 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.03483576817 |
| Sum | 6775 |
| Variance | 4.850111629 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 517 | |
| 2 | 367 | |
| 3 | 184 | 12.0% |
| 4 | 147 | 9.6% |
| 5 | 117 | 7.7% |
| 6 | 100 | 6.5% |
| 1 | 97 | 6.3% |
| Value | Count | Frequency (%) |
| 1 | 97 | 6.3% |
| 2 | 367 | |
| 3 | 184 | 12.0% |
| 4 | 147 | 9.6% |
| 5 | 117 | 7.7% |
| 6 | 100 | 6.5% |
| 7 | 517 |
| Value | Count | Frequency (%) |
| 7 | 517 | |
| 6 | 100 | 6.5% |
| 5 | 117 | 7.7% |
| 4 | 147 | 9.6% |
| 3 | 184 | 12.0% |
| 2 | 367 | |
| 1 | 97 | 6.3% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 101.3 KiB |
| DOWNTOWN | |
|---|---|
| GREATER DUWAMISH | |
| LAKE UNION | |
| MAGNOLIA / QUEEN ANNE | |
| EAST | |
| Other values (8) |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 10.75212557 |
| Min length | 4 |
Characters and Unicode
| Total characters | 16440 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DOWNTOWN |
|---|---|
| 2nd row | DOWNTOWN |
| 3rd row | DOWNTOWN |
| 4th row | DOWNTOWN |
| 5th row | DOWNTOWN |
| Value | Count | Frequency (%) |
| DOWNTOWN | 361 | |
| GREATER DUWAMISH | 324 | |
| LAKE UNION | 142 | 9.3% |
| MAGNOLIA / QUEEN ANNE | 140 | 9.2% |
| EAST | 115 | 7.5% |
| NORTHEAST | 106 | 6.9% |
| NORTHWEST | 77 | 5.0% |
| BALLARD | 61 | 4.0% |
| NORTH | 57 | 3.7% |
| CENTRAL | 44 | 2.9% |
| Other values (3) | 102 | 6.7% |
| Value | Count | Frequency (%) |
| downtown | 361 | |
| duwamish | 324 | |
| greater | 324 | |
| lake | 142 | 5.9% |
| union | 142 | 5.9% |
| magnolia | 140 | 5.8% |
| anne | 140 | 5.8% |
| 140 | 5.8% | |
| queen | 140 | 5.8% |
| east | 115 | 4.8% |
| Other values (8) | 447 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1850 | |
| E | 1691 | |
| A | 1629 | |
| T | 1397 | 8.5% |
| O | 1309 | 8.0% |
| W | 1156 | 7.0% |
| R | 1030 | 6.3% |
| 886 | 5.4% | |
| D | 820 | 5.0% |
| S | 752 | 4.6% |
| Other values (11) | 3920 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15414 | |
| Space Separator | 886 | 5.4% |
| Other Punctuation | 140 | 0.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 1850 | |
| E | 1691 | |
| A | 1629 | |
| T | 1397 | |
| O | 1309 | |
| W | 1156 | 7.5% |
| R | 1030 | 6.7% |
| D | 820 | 5.3% |
| S | 752 | 4.9% |
| U | 671 | 4.4% |
| Other values (9) | 3109 |
| Value | Count | Frequency (%) |
| 886 |
| Value | Count | Frequency (%) |
| / | 140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15414 | |
| Common | 1026 | 6.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 1850 | |
| E | 1691 | |
| A | 1629 | |
| T | 1397 | |
| O | 1309 | |
| W | 1156 | 7.5% |
| R | 1030 | 6.7% |
| D | 820 | 5.3% |
| S | 752 | 4.9% |
| U | 671 | 4.4% |
| Other values (9) | 3109 |
| Value | Count | Frequency (%) |
| 886 | ||
| / | 140 | 13.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16440 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 1850 | |
| E | 1691 | |
| A | 1629 | |
| T | 1397 | 8.5% |
| O | 1309 | 8.0% |
| W | 1156 | 7.0% |
| R | 1030 | 6.3% |
| 886 | 5.4% | |
| D | 820 | 5.0% |
| S | 752 | 4.6% |
| Other values (11) | 3920 |
Latitude
Real number (ℝ≥0)
| Distinct | 1465 |
|---|---|
| Distinct (%) | 95.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.61596899 |
|---|---|
| Minimum | 47.50943452 |
| Maximum | 47.73381054 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 47.50943452 |
|---|---|
| 5-th percentile | 47.5391732 |
| Q1 | 47.58787647 |
| median | 47.61229181 |
| Q3 | 47.6477973 |
| 95-th percentile | 47.70792077 |
| Maximum | 47.73381054 |
| Range | 0.22437602 |
| Interquartile range (IQR) | 0.05992083 |
Descriptive statistics
| Standard deviation | 0.04669993476 |
|---|---|
| Coefficient of variation (CV) | 0.0009807620376 |
| Kurtosis | -0.0442789242 |
| Mean | 47.61596899 |
| Median Absolute Deviation (MAD) | 0.02755594 |
| Skewness | 0.2878549453 |
| Sum | 72804.81658 |
| Variance | 0.002180883907 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.52593209 | 5 | 0.3% |
| 47.66375728 | 5 | 0.3% |
| 47.5829049 | 4 | 0.3% |
| 47.60831575 | 4 | 0.3% |
| 47.52131741 | 4 | 0.3% |
| 47.60387657 | 3 | 0.2% |
| 47.62124083 | 3 | 0.2% |
| 47.64171214 | 3 | 0.2% |
| 47.5616226 | 3 | 0.2% |
| 47.5309583 | 3 | 0.2% |
| Other values (1455) | 1492 |
| Value | Count | Frequency (%) |
| 47.50943452 | 1 | |
| 47.50991348 | 1 | |
| 47.5103068 | 1 | |
| 47.5106034 | 1 | |
| 47.51063848 | 1 | |
| 47.51081215 | 1 | |
| 47.51089546 | 1 | |
| 47.51184247 | 1 | |
| 47.51276131 | 1 | |
| 47.51277739 | 1 |
| Value | Count | Frequency (%) |
| 47.73381054 | 1 | |
| 47.73368341 | 1 | |
| 47.73314624 | 1 | |
| 47.731823 | 2 | |
| 47.73127184 | 1 | |
| 47.72971465 | 1 | |
| 47.72969445 | 1 | |
| 47.72957286 | 1 | |
| 47.72899702 | 1 | |
| 47.72842333 | 1 |
Longitude
Real number (ℝ)
| Distinct | 1450 |
|---|---|
| Distinct (%) | 94.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.3336496 |
|---|---|
| Minimum | -122.4116616 |
| Maximum | -122.2587951 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | -122.4116616 |
|---|---|
| 5-th percentile | -122.37786 |
| Q1 | -122.3428953 |
| median | -122.3332714 |
| Q3 | -122.3230797 |
| 95-th percentile | -122.2927134 |
| Maximum | -122.2587951 |
| Range | 0.15286651 |
| Interquartile range (IQR) | 0.0198156 |
Descriptive statistics
| Standard deviation | 0.02304426883 |
|---|---|
| Coefficient of variation (CV) | -0.0001883722828 |
| Kurtosis | 1.0374564 |
| Mean | -122.3336496 |
| Median Absolute Deviation (MAD) | 0.00998269 |
| Skewness | -0.09642095163 |
| Sum | -187048.1502 |
| Variance | 0.000531038326 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.3308402 | 5 | 0.3% |
| -122.3002168 | 5 | 0.3% |
| -122.3668974 | 4 | 0.3% |
| -122.3354485 | 4 | 0.3% |
| -122.3228994 | 4 | 0.3% |
| -122.3336945 | 4 | 0.3% |
| -122.333786 | 3 | 0.2% |
| -122.3534322 | 3 | 0.2% |
| -122.3337433 | 3 | 0.2% |
| -122.3386303 | 3 | 0.2% |
| Other values (1440) | 1491 |
| Value | Count | Frequency (%) |
| -122.4116616 | 1 | |
| -122.4084254 | 1 | |
| -122.4077751 | 1 | |
| -122.4077007 | 1 | |
| -122.4033599 | 1 | |
| -122.3999978 | 2 | |
| -122.3990624 | 1 | |
| -122.3973945 | 1 | |
| -122.3962337 | 1 | |
| -122.3941507 | 1 |
| Value | Count | Frequency (%) |
| -122.2587951 | 1 | |
| -122.2617596 | 1 | |
| -122.2623899 | 1 | |
| -122.2626508 | 1 | |
| -122.2629464 | 1 | |
| -122.2641702 | 1 | |
| -122.2657673 | 1 | |
| -122.2666646 | 1 | |
| -122.2681394 | 1 | |
| -122.268551 | 1 |
ZipCode
Real number (ℝ≥0)
| Distinct | 28 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98116.1465 |
|---|---|
| Minimum | 98101 |
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 98101 |
|---|---|
| 5-th percentile | 98101 |
| Q1 | 98104 |
| median | 98109 |
| Q3 | 98122 |
| 95-th percentile | 98134 |
| Maximum | 98199 |
| Range | 98 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 15.48591625 |
|---|---|
| Coefficient of variation (CV) | 0.0001578324954 |
| Kurtosis | 8.008767462 |
| Mean | 98116.1465 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 2.138777389 |
| Sum | 150019588 |
| Variance | 239.813602 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 98134 | 194 | |
| 98104 | 165 | |
| 98101 | 149 | 9.7% |
| 98109 | 135 | 8.8% |
| 98108 | 110 | 7.2% |
| 98122 | 83 | 5.4% |
| 98105 | 82 | 5.4% |
| 98121 | 76 | 5.0% |
| 98119 | 63 | 4.1% |
| 98103 | 61 | 4.0% |
| Other values (18) | 411 |
| Value | Count | Frequency (%) |
| 98101 | 149 | |
| 98102 | 30 | 2.0% |
| 98103 | 61 | 4.0% |
| 98104 | 165 | |
| 98105 | 82 | |
| 98106 | 23 | 1.5% |
| 98107 | 51 | 3.3% |
| 98108 | 110 | |
| 98109 | 135 | |
| 98112 | 15 | 1.0% |
| Value | Count | Frequency (%) |
| 98199 | 18 | 1.2% |
| 98178 | 2 | 0.1% |
| 98177 | 1 | 0.1% |
| 98155 | 2 | 0.1% |
| 98146 | 2 | 0.1% |
| 98144 | 41 | 2.7% |
| 98136 | 3 | 0.2% |
| 98134 | 194 | |
| 98133 | 51 | 3.3% |
| 98126 | 21 | 1.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| OSEBuildingID | DataYear | PrimaryPropertyType | YearBuilt | NumberofBuildings | NumberofFloors | PropertyGFABuilding(s) | PropertyGFAParking | LargestPropertyUseType | ENERGYSTARScore | SiteEnergyUse(kBtu) | TotalGHGEmissions | Location | CouncilDistrictCode | Neighborhood | Latitude | Longitude | ZipCode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2015 | Hotel | 1927 | 1 | 12.0 | 88434 | 0 | Hotel | 65.0 | 6981428.0 | 249.43 | {'latitude': '47.61219025', 'longitude': '-122.33799744', 'human_address': '{"address": "405 OLIVE WAY", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.612190 | -122.337997 | 98101.0 |
| 1 | 2 | 2015 | Hotel | 1996 | 1 | 11.0 | 88502 | 15064 | Hotel | 51.0 | 8354235.0 | 263.51 | {'latitude': '47.61310583', 'longitude': '-122.33335756', 'human_address': '{"address": "724 PINE ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.613106 | -122.333358 | 98101.0 |
| 2 | 3 | 2015 | Hotel | 1969 | 1 | 41.0 | 961990 | 0 | Hotel | 18.0 | 73130656.0 | 2061.48 | {'latitude': '47.61334897', 'longitude': '-122.33769944', 'human_address': '{"address": "1900 5TH AVE", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.613349 | -122.337699 | 98101.0 |
| 3 | 5 | 2015 | Hotel | 1926 | 1 | 10.0 | 61320 | 0 | Hotel | 1.0 | 28229320.0 | 1936.34 | {'latitude': '47.61421585', 'longitude': '-122.33660889', 'human_address': '{"address": "620 STEWART ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.614216 | -122.336609 | 98101.0 |
| 4 | 8 | 2015 | Hotel | 1980 | 1 | 18.0 | 107430 | 12460 | Hotel | 67.0 | 14829099.0 | 507.70 | {'latitude': '47.6137544', 'longitude': '-122.3409238', 'human_address': '{"address": "401 LENORA ST", "city": "SEATTLE", "state": "WA", "zip": "98121"}'} | 7 | DOWNTOWN | 47.613754 | -122.340924 | 98121.0 |
| 5 | 9 | 2015 | Other | 1999 | 1 | 2.0 | 60090 | 37198 | Police Station | NaN | 12051984.0 | 304.62 | {'latitude': '47.6164389', 'longitude': '-122.33676431', 'human_address': '{"address": "810 VIRGINIA ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.616439 | -122.336764 | 98101.0 |
| 6 | 10 | 2015 | Hotel | 1926 | 1 | 11.0 | 83008 | 0 | Hotel | 25.0 | 6252842.0 | 208.46 | {'latitude': '47.6141141', 'longitude': '-122.33274086', 'human_address': '{"address": "1619 9TH AVE", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.614114 | -122.332741 | 98101.0 |
| 7 | 11 | 2015 | Other | 1926 | 1 | 8.0 | 102761 | 0 | Other - Entertainment/Public Assembly | NaN | 6426022.0 | 199.99 | {'latitude': '47.61290234', 'longitude': '-122.33130949', 'human_address': '{"address": "901 PINE ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.612902 | -122.331309 | 98101.0 |
| 8 | 12 | 2015 | Hotel | 1904 | 1 | 15.0 | 163984 | 0 | Hotel | 46.0 | 12633744.0 | 331.61 | {'latitude': '47.60258934', 'longitude': '-122.33255325', 'human_address': '{"address": "612 2ND AVE", "city": "SEATTLE", "state": "WA", "zip": "98104"}'} | 7 | DOWNTOWN | 47.602589 | -122.332553 | 98104.0 |
| 9 | 15 | 2015 | Hotel | 1969 | 1 | 11.0 | 133884 | 19279 | NaN | 48.0 | 14719853.0 | 576.63 | {'latitude': '47.60712147', 'longitude': '-122.33431932', 'human_address': '{"address": "1101 4TH AVE", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.607121 | -122.334319 | 98101.0 |
Last rows
| OSEBuildingID | DataYear | PrimaryPropertyType | YearBuilt | NumberofBuildings | NumberofFloors | PropertyGFABuilding(s) | PropertyGFAParking | LargestPropertyUseType | ENERGYSTARScore | SiteEnergyUse(kBtu) | TotalGHGEmissions | Location | CouncilDistrictCode | Neighborhood | Latitude | Longitude | ZipCode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1519 | 49926 | 2015 | College/University | 1925 | 1 | 3.0 | 428347 | 0 | College/University | NaN | 36367960.0 | 1253.31 | {'latitude': '47.61676902', 'longitude': '-122.3215492', 'human_address': '{"address": "1701 BROADWAY", "city": "SEATTLE", "state": "WA", "zip": "98122"}'} | 3 | EAST | 47.616769 | -122.321549 | 98122.0 |
| 1520 | 49940 | 2015 | Hospital | 1920 | 1 | 8.0 | 374466 | 0 | NaN | 97.0 | 78652064.0 | 3894.01 | {'latitude': '47.60984009', 'longitude': '-122.3274412', 'human_address': '{"address": "925 SENECA ST", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 3 | EAST | 47.609840 | -122.327441 | 98101.0 |
| 1521 | 49945 | 2015 | Senior Care Community | 1989 | 1 | 3.0 | 167300 | 0 | Senior Care Community | NaN | 3681105.0 | 70.38 | {'latitude': '47.60895084', 'longitude': '-122.3421375', 'human_address': '{"address": "1531 WESTERN AVE", "city": "SEATTLE", "state": "WA", "zip": "98101"}'} | 7 | DOWNTOWN | 47.608951 | -122.342138 | 98101.0 |
| 1522 | 49946 | 2015 | Supermarket/Grocery Store | 2010 | 1 | 8.0 | 41198 | 0 | Supermarket/Grocery Store | 64.0 | 6879291.0 | 75.28 | {'latitude': '47.67057565', 'longitude': '-122.3866853', 'human_address': '{"address": "5700 24TH AVE NW", "city": "SEATTLE", "state": "WA", "zip": "98107"}'} | 6 | BALLARD | 47.670576 | -122.386685 | 98107.0 |
| 1523 | 49958 | 2015 | Other | 2014 | 1 | NaN | 20993 | 0 | Repair Services (Vehicle, Shoe, Locksmith, etc) | NaN | 912558.0 | 12.28 | {'latitude': '47.59524558', 'longitude': '-122.3229473', 'human_address': '{"address": "848 7TH AVE S", "city": "SEATTLE", "state": "WA", "zip": "98134"}'} | 2 | GREATER DUWAMISH | 47.595246 | -122.322947 | 98134.0 |
| 1524 | 49966 | 2015 | Other | 2009 | 1 | NaN | 40265 | 0 | Pre-school/Daycare | NaN | 1957356.0 | 42.40 | {'latitude': '47.54102707', 'longitude': '-122.31249237', 'human_address': '{"address": "4520 M L KING JR WAY S", "city": "SEATTLE", "state": "WA", "zip": "98108"}'} | 2 | SOUTHEAST | 47.541027 | -122.312492 | 98108.0 |
| 1525 | 49985 | 2015 | Large Office | 2014 | 1 | 6.0 | 257986 | 169195 | Office | 99.0 | 16730779.0 | 210.69 | {'latitude': '47.6233466', 'longitude': '-122.33968176', 'human_address': '{"address": "500 9TH AVE N", "city": "SEATTLE", "state": "WA", "zip": "98109"}'} | 7 | LAKE UNION | 47.623347 | -122.339682 | 98109.0 |
| 1526 | 49998 | 2015 | Self-Storage Facility\n | 2014 | 1 | 4.0 | 87576 | 14004 | Self-Storage Facility | NaN | 850568.0 | 12.40 | {'latitude': '47.5705386', 'longitude': '-122.2914015', 'human_address': '{"address": "3736 RAINIER AVE S", "city": "SEATTLE", "state": "WA", "zip": "98144"}'} | 2 | SOUTHEAST | 47.570539 | -122.291402 | 98144.0 |
| 1527 | 50002 | 2015 | Other | 2014 | 1 | 3.0 | -50550 | 84198 | Parking | NaN | 1389553.0 | 9.69 | {'latitude': '47.66411096', 'longitude': '-122.3166394', 'human_address': '{"address": "4741 11TH AVE NE", "city": "SEATTLE", "state": "WA", "zip": "98105"}'} | 4 | NORTHEAST | 47.664111 | -122.316639 | 98105.0 |
| 1528 | 50038 | 2015 | Mixed Use Property | 2014 | 1 | 2.0 | 25532 | 0 | Office | 84.0 | 628609.0 | 4.38 | {'latitude': '47.66199875', 'longitude': '-122.3867569', 'human_address': '{"address": "2360 W COMMODORE WAY", "city": "SEATTLE", "state": "WA", "zip": "98199"}'} | 7 | MAGNOLIA / QUEEN ANNE | 47.661999 | -122.386757 | 98199.0 |